Unsupervised Visual Representation Learning by Graph-Based Consistent Constraints
نویسندگان
چکیده
Learning rich visual representations often require training on datasets of millions of manually annotated examples. This substantially limits the scalability of learning effective representations as labeled data is expensive or scarce. In this paper, we address the problem of unsupervised visual representation learning from a large, unlabeled collection of images. By representing each image as a node and each nearest-neighbor matching pair as an edge, our key idea is to leverage graph-based analysis to discover positive and negative image pairs (i.e., pairs belonging to the same and different visual categories). Specifically, we propose to use a cycle consistency criterion for mining positive pairs and geodesic distance in the graph for hard negative mining. We show that the mined positive and negative image pairs can provide accurate supervisory signals for learning effective representations using Convolutional Neural Networks (CNNs). We demonstrate the effectiveness of the proposed unsupervised constraint mining method in two settings: (1) unsupervised feature learning and (2) semi-supervised learning. For unsupervised feature learning, we obtain competitive performance with several state-of-the-art approaches on the PASCAL VOC 2007 dataset. For semi-supervised learning, we show boosted performance by incorporating the mined constraints on three image classification datasets.
منابع مشابه
Unsupervised Visual Representation Learning by Graph-based Consistent Constraints Supplementary Material
1. We show cycle detection results using the proposed unsupervised constraint mining approach on the large-scale ImageNet 2012 dataset as well as three image datasets with generic objects (CIFAR-10), fine-grained objects (CUB200-2011), and scene classes (MIT indoor-67), respectively. 2. We show examples of easy negative image pairs (i.e., image pairs with large Euclidean distance in the feature...
متن کاملTitle: View{based Cognitive Map Learning by an Autonomous Robot
This paper presents a view{based approach to map learning and navigation in mazes. By means of graph theory we have shown that the view{graph is a suucient representation for map behaviour such as path planning. A neural network for unsupervised learning of the view{graph from sequences of views is constructed. We use a modiied Kohonen (1988) learning rule that transforms temporal sequence (rat...
متن کاملPerson Re-identification by Unsupervised `1 Graph Learning
Most existing person re-identification (Re-ID) methods are based on supervised learning of a discriminative distance metric. They thus require a large amount of labelled training image pairs which severely limits their scalability. In this work, we propose a novel unsupervised Re-ID approach which requires no labelled training data yet is able to capture discriminative information for cross-vie...
متن کاملDeep Unsupervised Domain Adaptation for Image Classification via Low Rank Representation Learning
Domain adaptation is a powerful technique given a wide amount of labeled data from similar attributes in different domains. In real-world applications, there is a huge number of data but almost more of them are unlabeled. It is effective in image classification where it is expensive and time-consuming to obtain adequate label data. We propose a novel method named DALRRL, which consists of deep ...
متن کاملThe Study of The Bay of Mount Saint-Michel by Using Graph Theory in The Analysis of Satellite Images
In this paper, a new approach for mapping based on the concept of objects and relationships between these objects is proposed to take advantage from both supervised and unsupervased classification methods. On the one hand, objects obtained after a supervised classification are represented by an adjacency graph model. On the other hand, objects obtained after unsupervised classification are repr...
متن کامل